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Abstract. We provide convergent hierarchies for the convex cone C of copos- 
itive matrices and its dual C* , the cone of completely positive matrices. In 
both cases the corresponding hierarchy consists of nested spectrahedra and 
provide outer (resp. inner) approximations for C (resp. for its dual C*), thus 
complementing previous inner (resp. outer) approximations for C (for C*). In 
particular, both inner and outer approximations have a very simple interpre- 
tation. Finally, extension to /C-copositivity and /C-complete positivity for a 
closed convex cone /C, is straightforward. 



1. Introduction 

In recent years the convex cone C of copositive matrices and its dual cone C* 
of completely positive matrices have attracted a lot of attention, in part because 
several interesting NP-hard problems can be modelled as convex conic optimization 
problems over those cones. For a survey of results and discussion on C and its dual, 
the interested reader is referred to e.g. Anstreicher and Burer |T], Bomze [2J, Bomzc 
et al. |3_, Burer 5 , Diir 8 and Hiriart-Urruty and Seeger [IT].. 

As optimizing over C (or its dual) is in general difficult, a typical approach is to 
optimize over simpler and more tractable cones. In particular, nested hierarchies of 
tractable convex cones Ck, k G N, that provide inner approximations of C have been 
proposed, notably by Parrilo [13], deKlerk and Pasechnik [7], Bomze and deKlerk 
[4j, as well as Pefia et al. [14]. For example, denoting by Af (resp. S+) the convex 
cone of nonnegative (resp. positive semidefinite) matrices, the first cone in the 
hierarchy of [7J is Af, and Af + S+ in that of [13] , whereas the hierarchy of [M] is in 
sandwich between that of [7] and [13]. Of course, to each such hierarchy of inner 
approximations (Ck), k G N, of C, one may associate the hierarchy (C£), fc e N, of 
dual cones which provides outer approximations of C*. However, quoting Diir in 
[8] : "We are not aware of comparable approximation schemes that approximate the 
completely positive cone (i.e. C*) from the interior." 

Let us also mention Gaddum's characterization of copositive matrices described 
in Gaddum [9] for which the set membership problem "A € C" (with A known) re- 
duces to solving a linear programming (LP) problem (whose size is not polynomially 
bounded in the input size of the matrix A). However, Gaddum's characterization 
is not convex in the coefficients of the matrix A, and so cannot be used to provide 
a hierarchy of outer approximations of C. On the other hand, de Klerk and Pasech- 
nik [6] have used Gaddum's characterization to help solve quadratic optimization 
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problems on the simplex via solving a hierarchy of LP-relaxations of increasing size 
and with finite convergence of the process. 

Contribution. The contribution of this note is precisely to describe an explicit 
hierarchy of tractable convex cones that provide outer approximations of C which 
converge monotonically and asymptotically to C. And so, by duality, the corre- 
sponding hierarchy of dual cones provides inner approximations of C* converging 
to C* , answering Diir's question and also showing that C and C* can be sandwiched 
between two converging hierarchies of tractable convex cones. The present outer 
approximations are not polyhedral and hence are different from the outer polyhe- 
dral approximations 0™ of C defined in Yildirim [20] . 0™ are based on a certain 
discretization A(n, r) of the simplex A„ := {x <E Wt ■ e T x = 1} whose size is 
parametrized by r. All matrices associated with quadratic forms nonnegative on 
A(n,r) belong to O r n for all r, and C — C\^° =l O r n ; Combining with the inner ap- 
proximations of de Klerk and Pasechnik [7], the author provides tight bounds on 
the gap between upper and lower bounds for quadratic optimization problems; for 
more details the interested reader is referred to Yildirim |20) . 

In fact, our result is a specialization of a more general result of |12j about nonneg- 
ativity of polynomials on a closed set K C M™ , in the specific context of quadratic 
forms and K = R™. In such a context, and identifying copositive matrices with 
quadratic forms nonnegative on K = , this specialization yields inner approxi- 
mations for C* with a very simple interpretation directly related to the definition of 
C* , which might be of interest for the community interested in C and C* but might 
be "lost" in the general case treated in [T2], whence the present note. Finally, fol- 
lowing Burer [5], and K. C M™ being a closed convex cone, one may also consider the 
convex cone Cjc of /C-copositive matrices, i.e., real symmetric matrices A such that 
x T Ax > on /C, and its dual cone C£ of /C-completely positive matrices. Then the 
outer approximations previously defined have an immediate and straightforward 
analogue (as well as the inner approximations of the dual). 

2. Main result 

2.1. Notation and definitions. Let R[x] be the ring of polynomials in the vari- 
ables x = (xi, . . . , x n ). Denote by E[x]d C M[x] the vector space of polynomials of 
degree at most d, which forms a vector space of dimension s(d) = with e.g., 

the usual canonical basis (x a ) of monomials. Also, let := {a £ N n : J^i a i — ^} 
and denote by E[x] C R[x] (resp. £[x]d C M[x]2d) the space of sums of squares 
(s.o.s.) polynomials (resp. s.o.s. polynomials of degree at most 2d). If / e K[x]d, 
write /(x) = X^eN" /a x " m the canonical basis and denote by f = (f a ) e W^ d>> 
its vector of coefficients. Finally, let S n denote the space ofnxn real symmetric 
matrices, with inner product (A,B) = trace AB, and where the notation A >z 
(resp. A >- 0) stands for A is positive semidefinite. 

Given K C W l , denote by clK (resp. convK) the closure (resp. the convex 
hull) of K. Recall that given a convex cone K C R n , the convex cone K* ={y£ 
R n : (y,x) > Vx e K} is called the dual cone of K, and satisfies (K*)* = clK . 
Moreover, given two convex cones Ki, K2 C W l , 

K* n K2 = (K!+K 2 )* = (K!UK 2 )* 
(Ki n K 2 )* = cl(KJ+K*) = cl(conv(K* UK*)). 

See for instance Rockafellar [T31 Theorem 3.8; Corollary 16.4.2]. 
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Moment matrix. With a sequence y = (y a ), a £ N n , let L y : R[x] — !> R be the 
linear functional 

h (=^/i Q x Q ) h. L y {h) = ^h a y ot , /ieR[x]. 

a a 

With rf 6 N, let M d (y) be the symmetric matrix with rows and columns indexed 
in N2, and defined by: 

(2.1) M d (y)(a,p) := L y {^) = y a+f} , (a, (3) eE n d x NJ. 

The matrix M<;(y) is called the moment matrix associated with y, and it is 
straightforward to check that 

[L y (g 2 )>0 V S 6l[x]] & M d (y) h 0, d = 0,l,.... 
Localizing matrix. Similarly, with y = (y a ), a £ N n , and / € R[x] written 

XH- /(x) = ^ / 7 x 7 , 

let M d (/y) be the symmetric matrix with rows and columns indexed in N^, and 
defined by: 

(2.2) M d (/y)(a,/?) := L y (/(x)x^) = ^/ 7 y Q+/3+7 , (a, /?) G x NJ. 

7 

The matrix Md(/y) is called the localizing matrix associated with y and / e R[x]. 
Observe that 

(2-3) (g,M d (/y)g) = L y (g 2 f), V 5 G R[x] d , 

and so if y has a representing finite Borel measure [A, i.e., if 

y a = J x a dfi, Va G N", 

then (|2.3p reads 

(2.4) (g, M d (/ y)g) = L y (g 2 /) = 5 (x) 2 /(x) d/i(x), V.g G R[x] d . 

Actually, the localizing matrix M d (/y) is nothing less than the moment matrix 
associated with the sequence z = fy = (z a ), a £ N", with z a — fiVa+j- 
In particular, if / is nonnegative on the support supp /i of \x then the localizing 
matrix M d (/ y) is just the moment matrix associated with the finite Borel measure 
dfif — fdfi, absolutely continuous with respect to /i (denoted fj,f -C fJ>), and with 
density /. For instance, with d — 1 one may write 



1 



Mi(/y) = 
or, equivalent ly, 

(2.5) Mi(/y) = mass( M /) x 



/(x) dfx(x) 



d/x/(x), 



1 I E A/ (xf 
E A/ (x) | E A/ (xx T ) 



where E A/ (•) denotes the expectation operator associated with the normalization 
jlf of [if (as a probability measure), and E A/ (xx T ) denotes the matrix of noncentral 
second-order moments of jjf (and the covariance matrix of jXf if E A , (x) = 0). 
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2.2. Main result. In [T2l Theorem 3.2] the author has shown in a general context 
that a polynomial / € K[x] is nonnegative on a closed set K C R n if and only if 



(2.6) / g z fdfi>0, Vj£R[x], 

JK 

where fi is a given finite Borel measure with support supp/i being exactly K; if K 
is compact then \x is arbitrary whereas if K is not compact then fj, has to satisfy a 
certain growth condition on its moments. If y = (y a ), a G N™, is the sequence of 
moments of /z then (|2.6|) is in turn equivalent to 

M d (/y) h 0, Vd = 0,l,..., 

where Md(/y) is the localizing matrix associated with / and y, defined in (|2.2|) . 
In this section we particularize this result to the case of copositive matrices viewed 
as homogeneous forms of degree 2, nonnegative on the closed set K = W±_. 

So with A = (ay) G S n , let denote by /a G M[x] 2 the quadratic form x M> x T Ax, 
and let /x be the joint probability measure associated with n i.i.d. exponential 
variates (with mean 1), with support supp/x = R", and with moments y = (y a ), 
a G N n , given by: 



(2.7) y a = x a d/i(x) = / x Q expC-Vx^dx = J] a,!, 



Va G N n . 



Recall that a matrix A G 5™ is copositive if /a(x) > for all x G R™ , and denote 
by C C 5" the cone of copositive matrices, i.e., 

(2.8) C := { A G S n : / A (x) > Vx G }. 

Its dual cone is the closed convex cone of completely positive, i.e., matrices of 5™ that 
can be written as the sum of finitely many rank-one matrices xx T , with x G R" , 
i.e., 

(2.9) C* = conv{ xx T : x G R" }. 

Next, introduce the following sets Cd G S n , d — 0, 1, . . ., defined by: 

(2.10) C d := { AGS" : M d (/ A y) h 0}, d = 0, 1, . . . 

where Mj(/ay) is the localizing matrix defined in (12.21) . associated with the qua- 
dratic form /a and the sequence y in (|2.7[) . 

Observe that in view of the definition (12.21) of the localizing matrix, the entries of 
the matrix Md(/A.y) are homogeneous and linear in A. Therefore, the condition 
M<j(/A y) is a homogeneous Linear Matrix Inequality (LMI) and defines a 
closed convex cone (in fact a spectrahedron) of S n (or equivalently, of ]R n ( ,l + 1 )/ 2 ). 
EachCd C S n is a convex cone defined solely in terms of the entries (a.^ ) of A G S n , 
and the hierarchy of spectrahedra (Cd), d G N, provides a nested sequence of outer 
approximations of C. 

Theorem 2.1. Let y be as in and let Cd C S" , d — 0,1, . . ., be the hierarchy 

oo 

of convex cones defined in \2.10\) . Then Cq D C\ ■ ■ ■ D Cd ■ ■ ■ D C and C — (~} Cd- 

d=0 
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The proof is a direct consequence of [12j Theorem 3.3] with K = R™ and f = /a- 
Since /a is homogeneous, alternatively one may use the probability measure v 
uniformly supported on the n-dimensional simplex A = {x£ K™ : ^\ Xi < 1} and 
invoke [121 Theorem 3.2]. The moments y = (y a ) of v are also quite simple to 
obtain and read: 



(2.11) 



Va 



I. .., 



Va G N n . 



(See e.g. Grundmann [TO].) 

Observe that the set membership problem "A G C^" , i.e., testing whether a given 
matrix A G <S ra belongs to Cd, is an eigenvalue problem as one has to check whether 
the smallest eigenvalue of M^(/a y) is nonnegative. Therefore, instead of using 
standard packages for Linear Matrix Inequalities, one may use powerful specialized 
softwares for computing eigenvalues of real symmetric matrices. 

We next describe an inner approximation of the convex cone C* via the hierarchy 
of convex cones (C*i), d G N, where each C* d is the dual cone of Cd in Theorem 12. II 

Recall that E[x]d is the space of polynomials that are sums of squares of poly- 
nomials of degree at most d. A matrix A € S n is also identified with a vector 
a £ JR"( n + 1 )/ 2 j and conversely, with any vector a € K n (™ +1 )/ 2 is associated a matrix 
A G S n . For instance, with n = 2, 



(2.12) 



a b 
b c 



a 
2b 

c 



So we will not distinguish between a convex cone in K n ( n+1 )/ 2 and the corresponding 
cone in S n . 



Theorem 2.2. Let Cd C S n be the convex cone defined in H2.10]) . Then 
(2-13) C:=cl{«X,M d (^y))) 1 < l < J <„ : XeSf}. 



Equivalently: 



C* d = cl 



(2.14) = cl {mass(/i (T )E^(xx T ) : a G E[x] d } , 

with fx a and E^ ltT (xx T ) as in i2.5\) . 
Proof. Let 

A d := {((X^^y)))^.^ : XeSf}, 
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so that 

A* d = <j a e M" (n+1)/2 : £ o« <X, M d (x iXj y) > VXe5f 

a € R'^+D/ 2 : ( X, M rf ( ( ]T a^a^) y ] \ > VXe S s + {d) 

l<i<j<n J I 

= {A e S n : (X, M d (f A y)> > VXe S 6 + {d) } [with A, a as in (l2"T2) ] 
= [AeS" : M d (/ A y) h } = C d . 

And so we obtain the desired result C d — (A^)* = cl(Ad). Next, writing the 
singular decomposition of X as J2k=o t lk <lfc f° r some s S N and some vectors 
(qfc) C R s ( d ', one obtains that for every 1 < i < j < n, 

s s 

(X,M d (xiX jy )) = ^(q^.M^y)) = q fc , M^x^- y)q fc ) 

fc=0 fe=0 

= ^2 x lXj q k {x) 2 dfi{x) [by 
XiXj ct(x) ei^(x), 

d/i„(x) 

where ct(x) = X)fc=o *( x ) 2 £ S[x]d, and fi a {B) = J B ad/j, for all Borel sets B. □ 



So Theorem 12.21 states that C d is the closure of the convex cone generated by 
second-order moments of measures d\x a = ad^i, absolutely continuous with respect 
to [i (hence with support on R™ ) and with density being a s.o.s. polynomial a of 
degree at most 2c?. Of course we immediately have: 

Corollary 2.3. Let de N, be as in [2l4\ >. Then C* d C C* d+1 for all deN, and 

OO 

C* = cl |J C* d . 
Proof. As C d C C d+1 for all d G BJ, the result follows from 

Coo \ * / oo \ oo 

fl CA = cl conv |J C* d = cl |J C* d . 
d=0 J \ d=0 I d=0 

□ 

In other words, C d approximates C* — conv {xx T :xel™} (i.e., the convex hull 
of second-order moments of Dirac measures with support in R™ ) from inside by 
second-order moments of measures \i a <C fi whose density is a s.o.s. polynomial a 
of degree at most 2d, and better and better approximations are obtained by letting 
d increase. 



Example 1. For instance, with n = 2, and A 



it is known that A 



a b 
b c 

is copositive if and only if a, c > and b + y/ac > 0; see e.g. [IT]. Let fi be 
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Figure 1. n — 2: Projection on the (a, 6)-plane of C versus C\, 
both intersected with the unit ball 



the exponential measure on with moments defined in (|2.7p . With /a(x) := 



2bx\X2 + cccj, and d = 1, the condition M^(/a y) h which reads 



(2.15) 



a + 6 + c 
3a + 26 + c 
a + 26 + 3c 



3a + 26 + c 
12a + 66+ 2c 
3a + 46 + 3c 



a + 26 + 3c 
3a + 46 + 3c 
2a + 66+ 12c 



defines the convex cone C\ Cl 3 . It is a connected component of the basic semi- 
algebraic set {(a, 6, c) : det (Mx(/ay)) > 0}, that is, elements (a, 6, c) such that: 

(2.16) 3a 3 + 15a 2 6 + 29a 2 c+16a6 2 + 50a6c + 29ac 2 + 46 3 + 166 2 c+156c 2 + 3c 3 > 0. 

Alternatively, by homogeneity, instead of p, we may take the probability measure v 
uniformly supported on the simplex A and with moments y = (y a ) given in (|2.11l) . 
in which case the corresponding matrix Mi(/ay) now reads: 

30(a + 6 + c) 6(3a + 26 + c) 6(a + 26 + 3c) " 
6(3a + 26 + c) 12a + 66 + 2c 3a + 46 + 3c 
6(a + 26 + 3c) 3a + 46 + 3c 2a + 66 + 12c 

It turns out that up to a multiplicative factor both matrices (|2.15|) and (|2.17|) have 
same determinant and so define the same convex cone C\ (the same connected 
component of (|2.16p ). Figure 1 below displays the projection on the (a, 6)-plane of 
the sets C\ and C intersected with the unit ball. 



(2.17) 



1 

360 



y o. 



2.3. An alternative representation of the cone C* d . From its definition (|2.13|) 
in Theorem [221 the cone C* d C S n is defined through the matrix variable X € S s ^ 
which lives in a (lifted) space of dimension s(d)(s(d) + l)/2 and with the linear 



s 



JEAN B. LASSERRE 



matrix inequality (LMI) constraint X y of size s(d). In contrast, the convex cone 
Cd C S n defined in ()2.10|) is denned solely in terms of the entries of the matrix 
A G S n , that is, no projection from a higher dimensional space (or no lifting) is 
needed. 

We next provide another explicit description on how C d can be generated with 
no LMI constraint and with only s(d) variables, but of course this characterization 
is not well suited for optimization purposes. 

Since every g G S[x]d can be written <? 2 for finitely many polynomials (gi) C 
K.[x]d, the convex cone S[x]d of s.o.s. polynomials can be written 

S[x] d = conv{<? 2 : g G K[x] d }. 

Next, for g G M[x] d , with vector of coefficients g = (g a ) G R s(d) , let g< 2 ) = (#i 2) ) G 
jjs(2d) vec ^ or f coefficients of g 2 , that is, 

«/(x)= Yl 5a x" , 9 (x) 2 = Yl 5i 2) x Q . 

(2) 

Notice that for each a G N^, <?a is quadratic in g. For instance, with n = 2 and 
d=l, g{x) = g 00 + gi Xi + goix 2 with g = (g 00 , g w , goi) T € K s(1) , and so 

g (2) = (,g 2 , 2 5o0 .gio, 2 5o0 .goi, 5io> ^9io9oi, 0o 2 ) T € K s(2) . 
Next, for 5 G M[x](j and every 1 < i < j < n, let = (Gy) € 5™ be defined by: 

(2.18) Gu := / 5 (x) 2 ^ , d^(x) = ]T ,gi 2) (a, + l)!(aj + 1)! J] a fc !, 

for all 1 < i < j < n. We can now describe C d . 
Corollary 2.4. Let G d G S n be as in WJ%\) . Then: 

(2.19) C* d = cl (conv{G d : g G } ) . 
Proof. From Theorem ET21 

C3=d{«X,M«,( a:4a:i y))) 1 < i < J < n : XeSf}. 

As in the proof of Theorem 12. 2\ writing X y as J2t=i SfeSfc f° r sorne vectors 
(git) G R s ( d ' (and associated polynomials (3%) C R[x]<j), 

( X, Md^a^- y) ) = Y x i x j9k(x) 2 dfi, = ^G-j-, 
fc=i -' ffi + fe=i 

for all 1 < i < j < n, where is as in (]2.18[) (but now associated with g^ instead 
of g). Hence 

s 

((X,M d ( Xi x jy )) = ^G* G conv{G d : g G K fl ( d > }. 

k=l 

Hence C* C cl (conv{G d : g G R s{d) }). 
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Conversely, let h G conv {G d :geR s W},i.e.,for some integer s and polynomials 
(<7fc) C K[x]d, and for all 1 < i < j < n, 



= E A * G ^ = / (E A ^fe(x)) xi 

k=lZT Jr + \fc=l / 



>0 

= (X,Md(xiXjy)) 

where X = £* =1 X k Sksl G Hence cl (conv {G d :ge M s < d ) }) C C*. □ 

The characterization (|2 . 19[) of C* d should be compared with the characterization 
conv{xx T : x G R™ } of C*. 

Example 2. Vt^it/i n = 2 and d — 1, Gi reads: 

\ 2 9qo + 12 9io(9oo + Soi) + 4g i(soo + 9oi) + 24gJ gg + 4g 00 (gi + g i) + 6(gJ + Soi) + 8 9io9oi 
L 9oo + 4 9oo(gio + soi) + 6(9i + 9oi) + 8910901 2go + 4gio(goo + 910) + 12g i(goo + gio) + 24g ! 1 

2.4. /C-copositive and /C-completely positive matrices. Let JC C R n be a 

closed convex cone and let C/c be the convex cone of /C-copositive matrices, i.e., 
matrices A £ S n such that x T Ax > on JC. Its dual cone C£ C S n is the cone of 
/C-completely positive matrices. 

Then one may define a hierarchy of convex cones (Cjc)d and (C^)d, d E N, 
formally exactly as in Theorem 12.11 and Theorem 12 .21 but now y is the moment 
sequence of a finite Borel measure [i with supp \i = JC (instead of supp /i = K" in 
(23)), i-e., y = (y a ), a G N", with: 

(2.20) y a := / x Q dp, Va G N" (and where supp n = JC) . 

And so with y as in (|2.20p Theorem 12.11 and 12.21 as well as Corollary 12.31 are 
still valid. (In ([2.141 replace J R „ with J^). Therefore, 

oo oo 

Ck = f](C K ) d and Cl = cl (JM- 

But of course, for practical implementation, one need to know the sequence y = 
(ya), a € N", which was easy when JC = R™ . For instance, if /C is a polyhedral cone, 
by homogeneity of /a, one may equivalently consider a compact base of JC (which 
is a polytope JC'), and take for /i the Lebesgue measure on JC' . Then all moments 
of [i can be calculated exactly. The same argument works for every convex cone JC 
for which one may compute all moments of a finite Borel measure /i whose support 
is a compact base of JC. 
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